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33 



34 



SEQUENCE LISTING 



CI) CEMCIIAL INFORMATION i 

(H i ) NTTMMft OF CBQUEM CEfii 1 3 6 

<2\V> (3) IMFORMAfl l lOK FOR - SBQ IP -NQ* 1 1 



(i) S KQUBMCE GHAIUkOTCRIGTIGOl 
1*\\> (A) LDMCnn i 60 b aoo p a ig 
2> (B) TYPE i nuoloA a ft oiid 
(C) srafl l TP BP M B BS i si ng l e 
m TO P O LO TYt Hn e nr 



PA/A 



^4pp> <y1) saoggMCB PKaawah ceq id hoc i : 

CTCCTGCAGT TGGACCTGTG CCATGGCCGO CTGGGCCGCA TAGAATGGAA 
CAACTAAAGC 



HlO> (2) mSQBMftlMQH F O R CEQ 



• 2z 



(i) CDQUBHOO OltARAO g CniCTIOC l 
^1/1> (*) LDMGW: 92 ■ 

- (C) GTHANDBDNECG I 
(n) TTiPOT^rtY- nnVnwn 

MnTFrnrff typf- prntnin 

^joQy ("*) SB Q rewrr nsccniPTTnu i ssq id m q : 2t 

Met Ala Glu Glu Arg Pro Tyr Ala Cys Pro 
5 10 

Val Glu Ser Cys Asp Arg Arg Phe Ser Arg 
15 20 

Ser Asp Glu Leu Thr Arg His He Arg He 
25 30 

His Thr Gly Gin Lye Pro Phe Gin Cys Arg 
35 40 

He Cys Met Arg Asn Phe Ser Xaa Xaa Xaa 
45 SO 

Xaa Leu Xaa Xaa His Xaa -tag- Thr His Thr 
55 /aA. 60 

Gly Glu Lys Pro Phe Ala Cys Asp He Cys 
65 * 70 

Gly Arg Lys Phe Ala Arg Ser Asp Glu Arg 
73 80 

Lys Arg His Thr Lys He His Leu Arg Gin 
85 90 

Lys Asp 



/ilo? (H) mronHAqion fob ceq i d n qi 3i 

<i) OEQUDHGE CHARACTER I OTIC 3. 
^W. (A) LENGTH t 26 base pairs A 
<Xf2> (»»WI»YPTi n u n! n ig sold PA/* 



"(G) CTIlANPEPNECC i s i n g le 
(■D ) T O POLOGY ! lino SB 



TATGACTTGG ATGGGAGACC GCCTGG 



3x 



<U0> CWW . \j t A 

S/HICHEI-6AM2A ■ Isilro 



50 
60 



450 > t>CT/biK/ol 



m$C_ FEATURE 



±222? (ft) ... t$Q 



^4- 



422 1 > H1SC - FEA7KAB 

<2l$ > /cjfiL. ~ eu*^ ItMCUiP CukA 



Tr^> Mis c - F£A M&L 



4220 > 

Z?I7> H2SC -TEA7URR 



26 



< a > IllPOnjlATIOH FOR JB Q TP HQ; 4: 



35 
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( i) GEQUOWCD OILARACTDniOTIGO B 
^gj(> (A) LENGTH i 28 bane pair a 

(D) TYPEi nu c leic acid PflfA 



00 V i Lin e 

+A&0> ( w A) S EQ U E NC E P SfiQRIPT B IOU S E g I P MO l 4 s 

AATTCCAGGC GGTCTCCCAT CCAAGTCA 



28 



( i) OBQUBMGD GmWAOTEniGTIOG t 
<jj\> (ft) LEHGTH t 21 ba ee p air e ■ 
y PYP Si nuoJiolo a oi el p/v^i 

(D) TOPO LO GY i Unonr 

/AOQ^ - (Y1) SE Q UENCE DTSCRIPTI O Hi CEQ ID HO i 5 1 

TATATAGCGT GGGCGTATAT A 



21 



<x\o> 



( i ) C Eg UBMCB C H ARACTER I gTIC Si 
4^1 1> (A) LBWGfDH i 24 Luua puin 
yT[T>(B) gyPBi wwiaAq acid 
y' ,if < C ) STRAW P E P M Bfi 6 1 ningA e 
( P ) TOPOLOGY i l ino a g 



/ ^O0> (* i ) BBQUBWCB DBCOniTOI' 
GCGTATATAC GCCCACGCTA TATA 



6l 



I P M O 8 7: 



( i ) SEQ U E MC B C H ARA CTERISTICS r 
^2J|> (ft) LENGTH t 21 Lap p yedLB . i 
V>r>S (D) TYPE i nuelol e acid - VnA 
V (C) STOAMDBPMBfiS i a i ng l o ^ 



< D ) TOPOLOGY ! linear 

4$QQ> ( xi ) sequence p EgcmireroH * ceq 

TATATAGCGN NNGCGTATAT A 

(2 ) INFORMATION POn CEQ ID MO I 8x 

f i) 6EQUBWCE CHARACTER IST IC S « 
f H WWW > p ftJL c 



E>jjojfO a oid 



f6-) — 6*gRANDBONSS6 + -o ip gl c 

jJ^&Q? ( X i) OEQUSHCE DESCRIPTION! 0! 
GCGTATATAC GCNNNCGCTA TATA 



7i 



ML. 



8: 



21 



4 <A *.,c.Qe*t 



K i* A. , c, 



(2) information ron ogq id m* 9i 

(i) GEQUBN0I3 GllftnAGTEniCTIOC i 
^3fl> ( A > L E MCTH i 33 b— pnite ... 
^TTTT (B) TYPE i nucleic aoid J>A//l 



(C) STBAMDBDNES8 i single 
< P > - - T O P OLOG Y i JLA n aa g ■ 



^k^> S g Q UgMf:E DESCR I PT IO N! SEQ I P M O I 9t 

TTCCATGGAG ACGCAGAAGC CCTTCAGCGG CCA 



33 



101 



<2R> (ft) LENGTH I 33 b a oo pa A 
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(B) TYPEi nuo le io a oid - fWA 



(O) ■ 6TRAHDEDMBCG I oing l o 
<D) TOPOLOGY i lift oa t? 

^gg^ ( xi ) S EQ U E NCE DB fiCR I PTI O M i CBQ ID HO i 10 1 

TTCCATGGAG ACGCAGGTGA GTTCCTCACC CCA 33 

( i ) S EQ UEHCE - 
^3.1 1> (ft) LTNGTHi 33 baae paire » 
^T75> t* ) na olelo acid Vhln 

4r*^ (C) 6TRAHDEPNEC6 • oinglo 
(P) TOPOLOGY i HnB«r 

^4&Qj> < * di> GEQUBHQB PBGOniPWOH i GEO. ID «Ol Hi 

CCCCTITCTC TTCCAGAAGC CCTTCAGCGG CCA 33 



£l\Qs* ( 3 ) IMFORMATIOH FOB CBQ ID KO » 12l 
( 1 ^ WffWMa GKAHACTCniGTIOO 



Any 



k (A) LEMGTH : 33 amine a e i d a 
(B) TYPE: nm . lno a m i d - PITT 
" (G) OTRANDDDMEOOl 



( il ) MOLECULE TYPE * peptide 

^4Q9> (wi) CPQUuHCB . D B OCniPm^OM i GflQ ID HO i 12< 

Met Ala Glu Glu Lyo Pro Phe Gin Cya Arg 
5 10 

He Cya Net Arg Asn Phe Ser Asp Arg Ser 
15 20 

Ser Leu Thr Arg His Thr Arg Hie Thr Gly 
25 30 

Glu Lye Pro 

<2lQ> (2 > INFORMATION TOR ODQ ID HOi 13i 

(i) CEQUBNOB CltAnACT ER I S TI C S l 
<^\\> I A\ IiDMQTlI t 33 nmin«-&rAAo — 

^ - (O) gthakdepmegc i 



(li) H0LP0UU3 TVPD i pep tid e 
<#0£> ( ni ) fi BQUB MCB DB SC RI PT IO M l M 6j I P - M ttO L3f 

Met Ala Glu Glu Lye Pro Phe Gin Cya Arg 
5 10 

He Cye Met Arg Aan Phe Ser Glu Arg Gly 
15 20 

Thr Leu Ala Arg Hia Glu Lye Hie Thr Gly 
25 30 

Glu Lye Pro 

(g) IM FORMAT IOM FOR OCQ ID WO; 14: 

(i) OEQUDHCD GltARAOTEniCH C g » 
^ff? (A) LEMCTH i 27 amino aoida 
✓ (D) TYPE t amin e e c id VffJ 



(C) STRAND 
■ <D) TO P OLOGY * linea r 



(My MOLECULE TYPBt popHrlffi 
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^> ( M il) 6SQUBHGB DBGOniPglOlU CEQ ID H O I 14: 

Phe Gin Cya Arg lie Cya Met Arg Aan Phe 

5 10 

Ser Gin Gly Gly Asn Leu Val Arg His Leu 

15 20 

Arg His Thr Gly Glu Lye Pro 
25 



15: 



(i.) SE QUEN C E CHARACTER IGTIGO I 
4^112 (A) LBKGflH i 26 -a min e asid e 
S2I*7 W TYT?K- amino s old - PfiJ 

=*- (0) GTRAHDDDHEGG I 

(P) TOPOLOOV i unkno w u 

(ii) H0LE0PLD TYPO: p e p tide 

^4&Q> (ac i ) OBQUOHCE DEOCRIPTIOH. flEQ ID tiOl 

Phe Gin Cye Arg lie Cys Met Arg Asn Phe 
5 10 

Ser Gin Ala Gin Thr Leu Gin Arg His Leu 
15 20 

Lys His Thr Gly Glu Lye 
25 

41.1$ (fl) TMPOnHABaOM rOD GDQ ID HO: 16: 

(i) CEQUPKOD GIIARAGTERI0TIC3 - I 
SXl\> < ft > U3HOTH : 26 amin e a c ld a 
*T|3> < B ) TTOSi m Ab o as id . PrT 

V^ f ■ (G) fiTRAHDBDMEOO: 

( D) TOPOLOGY i uuXauwu 

(ii) MOLBOULP TYPO 8 pe ptide 

^4&0> (H i ) 6EQUBMGB DEGGRIPTIOHi OEQ ID WO I 16: 

Phe Gin Cys Arg lie Cys Met Arg Asn Phe 
5 10 

Ser Gin Ala Ala Thr Leu Gin Arg His Leu 
L5 20 

Lys His Thr Gly Glu Lys 
25 

j£l\Q> (3) IHFORMATIOH FOR 0DQ ID 1 1 0 1 17 1 

(a.) 6E QUEWCE 9BMftBWM6WW I 
Sllt> (A) UWWM 26 ai a i n e aoi do 

\2.> W PM4 ft ml no aoiri 

(C) fiT B A RPBDKSSG^ J 1 — " 

(D) TOPOLOGY* unhn eww 

(ii) M OLECULE mm* pop e ido 

/jfaQ ,> < * *) S EQUENCE DE fiC niPB IO M l CEQ ZD M P: 17: 

Phe Gin Cys Arg lie Cye Met Arg Asn Phe 
5 10 

Ser Gin Ala Gin Asp Leu Gin Arg His Leu 
15 20 

Lys His Thr Gly Glu Lys 
25 



ron cdq id no i 18: 



6,007,988 

41 42 

^continued 



6 EQ UEMGD OIIAHAOipDniOTIOO B 
LKUGTHi 89 - 



Oy?> (B) TYPTTi nmino acid Df}J 

' = (C) ST R ftH PB D M SCO i J " 

( D ) TOPOLOGY i unknown 

(ii) MOIiECULS TYPE: prortetn 

4&0V> jr\y fiffQITSKCS-DSSCRIPTIOMtn SEg ID MOt IBs 

Met Ala Glu Glu Lye Fro Fhe Gin Cye Arg 
5 10 

lie Cye Met Arg Aen Phe Ser Asp Arg Ser 
15 20 

Ser Leu Thr Arg Hie Thr Arg Thr His Thr 
25 30 

Gly Glu Lye Pro Phe Gin Cye Arg He Cye 
35 40 

Met Arg Aen Phe Ser Aep Arg Ser His Leu 
45 50 

Thr Arg Hie Thr Arg Thr HIb Thr Gly Glu 
55 60 

Lye Pro Phe Gin Cye Arg He Cye Met Arg 
65 70 

Aan Phe Ser Aep Arg Ser Aen Leu Thr Arg 
75 80 

Hie Thr Arg Thr Hie Thr Gly Glu Lys 
85 



4^0> (2) IWrOPJUWIOH TOR 0EQ ID «O i 19i 

(\\ RTtQmvr? rHftRftrnrBTSTTrst. 

£ju> (A) LDNQT Hi 10 amino aoido 
> 2j2> (D) TYPE i maiau aulil pgj 

(D) TOPOLOgyt un k n o wn ■ 

• Hi) MO LECULE BYPG i pe p tide 

( xi ) S E QUE M CE DE 6 CniI«PIOm CDQ ID MO: 19: 

Arg Ser Aep Hie Leu Thr Thr Hie He Arg 
5 10 

^jJ£Z t*i raroniWHOi i for seq id mo i 201 

<1J OEQUDHCO CnADACTIlRICTIOOt 

I ✓ 1 / n \ IMTMraUTI t 1 A 1 J J— 

y2t2> W TYPE 1 amino aai d PQX 

^— (G) fiTHAKDDDMDOO I Q 1 

( D) TOPOLQOYl unJuiuwu 

(H) MnT.Krnr.T: type: peptide 

> ( Xi ) SEQUEMG E B B C Cn IDT I Q UI SEQ ID NO I 20t 

Arg Val Aep Ale Leu Glu Ala Hie Arg Arg 
5 10 



(3) IHFORMATI ON-TO n OEQ - ID MO » 21: 

( i ) CE Q U S M CS CHA R ACT ERIS T ICS I 
^2| (>-4A4-LB»CTHt 10 <afaw auidu 
73j5> ( B ) TYPE i a min o ao i d fiOf 
^ ^fr-SSRAHPE DNE O O t « r , 

(D) TOPOLOGY! mduiuwu 



( ii ) MO LE C UL E TYPE i peptide 
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&!&0> ( l it) GDQUDNOC DCCOnjriJIOtl i CBQ IP MO i 21 1 

Gin Arg Ala Ser Leu Ala Ser His Met Arg 
5 10 




til etotewcs cHaRnrnrPTSTTrsT 
^ J \> (fl) r.ffMGTHr m amino aaida. 
y ^|2> ( B ) X¥P S-i-«n4nG-aeA4 P&T 

(D) TOPOLOGY ! unknown 

(&i) W OLBODLE ( B YPD i pept ide 

///gQ> (x l ) S E Q UENCE P ESCR I Pg lQ M i SEQ ID MO I 22t 

Asn Arg Aap Thr Leu Thr Arg His Ser Lys 
5 * 10 

{2)0^ ( 8 ) IMPORMATIOM TOR OBQ ID MO i 23 1 



■(j) SEQUENCE CHARACTERISTICS* 
/j//> (A) W WBh 10 - amino a oido 
^2I2> (B) TYPFi n i ami fwv^if.id— £j?T 



(D) TOPOLOGY l owJwewa 

MOL E CULB TV P D: p eptide 

^4&0^ < w i) GEQUBHOD DDOaniI"PIOM I CEQ ID HO I 23: 

Gin Lys Gly Hia Leu Thr Glu His Arg Lya 
5 10 



'2\0*> <2) IHPORMATIOM TOH CBQ ID MO i 24* 

(<) SFQUFianr rHttBarnmT.STTr.fi i. 

/ill> (A) LPMGTIU 10 . amino a c id g 

)p>(») TYPE i amino acid DO T 
S f . (0) CTRAHDEDWEGGl rn 1 1 

( D ) TOP O LOOTi uukuuwi 

(ii) M OL ECU LE TYPE l pepti de 

A^lOO^ ( w i » GEQUDMGE DDGQRITOIOHl GDQ ID MO 

Gin Ser Val Hia Leu Gin Ser Hia Ser Arg 
5 10 



^i|P> (S) It i rORMATIOM - FOR CEQ ID WO f. 25i 

4X\\> tA) LENGT H i 10 amine a e ido 

2> (D) TYPE; wnfato acid PkT 

(D> TOPOLOGY i ufOwow 



Jti $$*> - (Mi) CEQUEMCE . DBCCniP'g J OM l CEQ IP M C M 25 1 

Arg Leu Aap Gly Leu Arg Thr Hia Leu Lya 
5 10 

^2l(P^> < fi > IMPOnMAT-IOM STIR SFQ TT1 NflT 26 1 

fi-) GPQUONOD CHAnAOTEniCTIOO I 

( A) LEWGSHl aoidc 
*T j (1) TYPTi amino nMH p Q*T 

i 2 l * (C) SraANP E PM ESSl I ri ' 

(D) topology i unJcn evB- 



(ii) MOLBOULB IVPE i popt i d . 
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4]%QQ*> ( r x) OEQUBHCB DDOCRIPTIOHl GEQ ID HO I 26l 

Thr Pro Gly Asn Leu Thr Arg His Gly Arg 
5 10 

^}0y (2) iwrorotAttioM ron cbq » wo t 27 % 

( i ) 6B QVEMC S - CHARACTER i 
/^{T^ -W LBBOTH i 10 ' amino aoido 
^|^ y (B) TYPE i amino aoid pK C 

(D) TOPOLOGY ! unkno w n 

( 11) MOLECULE TYPDl p e p ti de 

££jg£ £ ( x i > SaQUBMO j PESCRIPTI Q Mi 6E Q ID M O I 27 1 

Asn Gly Gly Aan Leu Gly Arg His Met Lys 
5 10 

(S ) IMPORMATZOH FOR 3 E Q IP HOi 28: 

(1) SEQUENCE CHARACTER I ST I C S I 
<jJ/> (ft) MWg» 10 am i n o a oido _ 
777v> TYPFt a mino a rlrl flg | 
^^'^ (g) CTRAH P EPHEOBt ' — 

(D ) TOPOLOGY* tttfaW W B 

(11) HOLEQULB HYPB l popfei de 

jJ^QO^ (Xi) SEQUEWffg DgfiCBTPTTONl SEQ 10 MO I 28 1 

Arg Ala Asp Ala Leu Het Val His Lys Arg 
5 10 

^2\Q> ( 3 ) IMFORMAWM FO B S E g IP M Oi 29: 




"{ C) 3 T RAHPEDHE33 I 
fD) TOPOLOGY i unkn own 

«efc4.->-MQ LEOUIiD TYPD l peptide 



Asn Gin Ser Asn Leu Glu Arg His His Arg 
5 10 



■ g2/fl >< 3) niropjwjiOM ron odq id - hoi 30» 

- ( 0. ) SEQ U SMCE— GHAR ACTE R IgTICg t 

( A) " LEN G TH ; 10 amino acid s 
^y^lJ ^(D) TYTDa amin o aoid PfCi 

< h — (C) CTnA y p aroffleet J 
(D) TO P OLOGY ! unkn own 

(11) MOLECULE TYPBi peptide- 

Asp Arg Ser Asn Leu Glu Arg His Thr Arg 
5 10 



Hf&s" j 2 ) I HF O RMATIOH P OR OCQ ID HO l 31 1 

{L\ CCQUDNOII OIIAP AG TERICBIOC -f 
<^ti3LW LBMOXfti 10 amino-aelda 
^7» 1D> HYPD t amino ooid P/<H 
t O) 0TRAMDEDHD00 r " 
(D) TOPOLOGY i unkno w n — 



29i 



(ii) M O LECULE TYPE l popt id i 
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jdjQb^T ( Mi ) CEQU g MOC DBConzriiom obq t p mo. 31 S 

Arg Ser Aop Thr Leu Lye LyB His Oly Lys 
5 10 

^2/ 0 ( 3 ) iwo b mapkw r o n sea i d n o r 32s 

(i) SEQU E NC E C H ARACTERIST I CS! . 

_f ? ff> ( ft) fc BWCBH t 10 amino ■nlrtn 

(D) «»• amino aoiUU WJT 

(D) TOrOLOC¥ l Mimw 

(Ai) H O L S C UkB TYP E l popfc ido. 

<Jll ) ODQUm i GD DI10Cnira!IOU i M0130 - t D , HO 4 32: 

Gin Gin Ser Asn Leu Vol Arg His Gin Arg 
5 10 

^J0> (a> imtorhat iq m r o n cdqid HO l 33s 

^^■ll> ^u^LSKesm 10 amino ooirfo 

- {G ) ■ 4'i'ltAHDB DME30 1 
-fP-HPOPQLQQYi imVnnwn 

4ii) Mnr.Krm.T! tybr? ppintlrtn 

£4&)^ (Hi.) fiSQUBMOD OBOOniTOIOMl ODQ IP WO t 33s 

Asn Gly Ala Asn Leu Glu Arg His Arg Arg 
5 10 



^2.|0>-<g) IWrORMATIOH FOn - CDQ ID MO i 34 s 
( ij j SEQUENCE CHARACTER I ST ICS s 



Sf|>( ft) LEMQTIl l lO^Hoi* 



{Bf-VSP Bt amino aci d P^T 

(P) TOPOLOGY * unknown 

M O LBCPLg TIPM peptide 

iJjfiO^ (Kd.) 6EQUBN6D DBOGRIPPIOMl ODQ ID NO) 34 S 

Arg Glu Asp Ala Leu Thr Ser Hie Glu Arg 
5 10 

^>/c> ( 3 ) j raronMMion ron cdq id noe 35s 

( ■i) . SEQ OBWCB ..C H A 



^iLLX(A) LRNfTTHT 10 umino nciido - 



PCTinf7¥innintinwwn 
fit ) M OL E CU L E TYP E l popfrido 
(H i ) S EQUENCE DEGGniPHOi lt GEO , ID NO t 35 1 



Arg Gly Asp His Leu Lys Asp His lie Lys 
5 10 



^21 Qy (2 ) INFORMATION POR ODQ ID WO; 36s 

(L) SB QPEKCB CHARACTER 1 6 0? IC S I 
A Xi\> LEMCTK i 10 *m.ino soldo 
— rtyZ (D) TYPDt amin o a c i d PA»T 
/ , ^ ( C ) 8 T RAHDEDWE30 1 ' 
( P ) T O PO LO GY i unknown 
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6B QUE I TCE DBECRIPTI O Ml SEQ ID W O I 36 : 

Arg Oly Pro Aap Leu Ala Arg His Gly Arg 
5 10 

/2l0^ ( fl ) MWQBHflHOM WW CEO . » H O I 37 J 




CD CHARAOTDRIQEIOOp 
(A) LEMGTii I 10 ^ mino aeAda r — 
,(D) TYTB i amine a c id Pft [ - 

( C) GTRAND B BMEOO I 

(P) BPCOLOCatl ttnknrwim 

(id,) MOLECULE TYPE! pept i d e 

/A&0y ( M l) OBQUBMOB DDCOniPUOW l CBQ ID WO t 37 1 

Arg Glu Asp Val Leu He Arg His Gly Lya 
5 10 

y2/0> <B ) nironiiiwioM ron odq id ho i 38t 

< i 1 0CQUDHGH GIIARAgTBRIOTIOO i 
4 pl? <A1 LDMGttH i 10 nmlno ■nlrin 
j|» (P) T¥Tnt amin e a e ld 

( D) TOP O L O GY i un k n o wn 
( ii ) MOLECULE TVP fii p e pt ide 

jJ ^GO^ <tH) SBQUBMCt DESCBMBEIOM I STQ TD MH| 3B: 



Arg Ser Asp Leu Leu Gin Arg His Hi a Lys 
5 10 



( 2 ) mroRMATioH ron ODQ ID HO t 39* 

( i > 8EQUEHGE CUARACTI]RIOM e6< 
" *j> (A) LEHGTH u 10 nnrinn arMclg 
» ( B) TYPE i amino aci d P£T 
% (C) aTJU Mp g ow gaai — 1 

(P) TOPOLOOVi unknown 

(ii) HOLDOULB TY PE i pept id e 

/$<&D'> pet) S E QUENCE BE 3 CHI PT IOHl GBQ ID MO; 39l 

Arg Gin Asp Thr Leu Val Gly His Glu Arg 
5 10 



L2\0s> (3) I NFOR M ATI O N EO R S EQ IP M Ot 40: 
<j>||> (A) LBHCTH i 10 amino acid 



(i) S E QUENCE CHAIUWGTEniCHOO t 

Z??5> < B > TYPBi a mino ac id J^T 
^s?» .frit ranajiaragnuvcc . *" < "" ^ 



* ( 0 ) CTOAMBBDNEGG : 
(P ) TOPOLOOVi unknown 

(ii) M OLECULE ttVPEi popeid e 

( x i) ODQUDMCD DBQCRIPTIOM I ODQ ID MO I 40i 



Arg Ala Ala Aap Leu Asn Arg Hi a Val Arg 
5 10 



^Jq> ' < g > IH FORMATION FOR ODQ ID HO l 41 1 

(i) SSQUBMCS CHARACTERISTICS I 
^3fO (A) LEMCTH i 10 -am in e ae id a ; 
7>f3 >< a > TYP E i ami no a ei d f 
*=1 (C ) 6TRAMPEDWB93 1 — 

( P ) TOPOLO G Y t u nknown 
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/tl&O ( K i ) figQUBMGJB PBCGRWMCIH CBQ ID WO l 41} 

5er Gin Gly Aan Leu Gin Arg Hie Gly Arg 
5 10 

^2j0^ P) TM FORMAT IQ M T O R CBQ ED HO l 42: 

^>t> (ft> HWBft 10 aain e aoldo k t 
> ^)?> tBl TYffFt iminn eoAd j 

(D) TOPOLOGY : unknown 

(li) BOLSCttJ TYPBi popeido 

(x i ) SBQUBMCB DESCRIPTION ! g f Q » Mfti 42: 

Thr Gly Gly Ser Leu Ala Arg Hie Glu Arg 
5 10 




) Tnpnr.rvsvt .untawa — 

(ii) MOLECULD TYTDl p e p tide 

£J$£££> r < * l) CPQUQNGC DPOGRIPTlOMi SBQ ID H QS 43: 

Asp Hie Ala Asn Leu Ala Arg Hie Thr Arg 
5 10 

^J.l0^ (2) IHFORHATIOM FOR CBQ ID KO : 44) 

<i) OEQUEHCC CHARACTER 1 S T ICO l 
^ 5|)!^t A) LBWOTH*: 10 -aafai e— a eAda u 
~|J^tJ ) TYPE: a mino a old Pf( \ 

tD) TOPOLOGY j unkno wn 

(ii) MOLECULE TYrE i peptide 

iJ&O^ (*i) CEQUDKGD DBOORIPTIOM: ODQ ID NO I 44: 

Leu Gin Ser Aon Leu Val Arg Hie Gin Arg 
5 10 

<^K>> (3) IMFORMAfPIOH FOR CBQ IP MO i 45: 

(A ) S EQUE WC S CH ARA CTER TSTTPSf 
^2]/^ _( a ) L BMCTH ; 10 a min o aoida 
y J, TYTD i amin e a c id P/vf 

IC) DTRANDEDHEOO : 

(uj TDPQcocn unknown — 

1 (ii) MOLECULE TYPE : popeido - 

Jjfe&> ( *i ? SB Q fflWCB BB3GRIITI0H I ODQ ID HO I- 45: 

Arg Lye Asp Val Leu Val Ser Hie Val Arg 
5 10 

2i<0*> (2) IN FORMAT IOH FOR CCQ ID HO 




V^^> (D) TYPDt amin e aci d Pf< 1 
J^ - ■■■( C ) a i TU W DDDNDOO r 



i ^ i|_ ^ ^ ^MQLE( 
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( xl) OEQUBHCE DBO e RIF T t e i t i OBQ IP HO: 46t 



Arg Arg Asp Val Leu Net Aan His He Arg 
5 10 



M I P 0 PMAH 0 1 I TOR 8BQ ID WW 47i 

( 1 1 OCQUPBCD GllAnA0TI]niGTIOC > 
Qf )> (A) LEMGtt Hi 10 a»fa*Q aaldo 
2^72^ (0) PWQ ' amine a c i d pHj 



-(6f 

< » ) ■ TOPOLOGY i unkno 



(it) MOLBCtfrg TYPE! p e ptlrta 
^<4&£>>_ ("i-) CEQUEHGE P ES CRIPTIOM i CB Q ID M Qc 47: 



i?\o> 



Oln Oly Gly Aan Leu Val Arg Hie Leu Arg 
5 10 



<2) IWFORHAT IOH FOR ODQ ID MO: 48: 



' (e ) 0TRANDDDHEO08 
(D) TOPOLOGY i unknown 



(ii) MOLDCULD EVPDi pop e &do 

< * i) GBQUDMGD DDOORIPTIOH l CEQ IP HO C 48: 

Ser Arg Asp Val Leu Arg Arg His Asn Arg 
5 10 

Kl.\0^ (2) IMPORHAJIOH FOR CEQ ID HO: 49: 

(i) GEQOBKCE OHAnftCTEB Iff TICS : 
^ ylO lM LBXUSXH i 10 wiotMcida-^ 

— topology i cntaown 
( ii ) Mpucqf TYPE i pept i d e 

JjL^CQ^ -fw4^— S E Q UE M C E D Bg CfUPTION i CEQ ID MO I 49: 

Glu Lyo Ala Thr Leu Ala Arg Hi a Met Lys 
5 10 

<2/^> (fl) IMFOnMATI O W FO R 6E ft IP 110; 50: 

43r»-SSQOE«C E CHAn/lCTSnig-J iCfii 
^|)>W~tiBti8*ttr- 1 0 'tu nine e e ldo pRJ 

jyp^ t^) TYPE i a mino aoid ■ — 

*< f nT , (C) 3TRAUDSDNE3 3 I 

■■< D) ■ TOPOLOGY » ■■ unknown ■ 

T(li) MOLBOULB IVPE t pe p t id e 

< /AfQG> l" 1 ) WOBNOB DBOOniPiriOW l CEO ID MP I 50l 

Oln Ala Gin Thr Leu Gin Arg His Leu Lye 
5 10 

^,?/«?J> " ( 8 ) affOWAWQW BOB S8Q Tn MOi 51: 

<1) 0EQUDMOP OltAnAGTEIlIBTICS: 
<>IY> W LDMOTH » 10- a min o molds 

JT . ■ f . r ^l ( o ) GTHANDBDHE 66*- ™ 1 

(D) TOPOLOGY i unkn o wn 
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^tyiW!> OEQOHKOB D B CCB i TO I OM l CB g I P HO l 51 1 

lis Ala Ser Asn Leu Leu Arg His Gin Arg 
5 10 



(3) IWOmWMOM MB CBQ ID MO i 52: 

(L) sequence caAaacaaaxsaicsj 

<AV LDHOTH i 10 amin e aoAdo 
J*L)j&m TYPFt wminm sold PJ?T 
— "(G) 6TOAKDBDKH0D t — 

(P) TOPOLOGY i mriwewa 
(i i ) MOLECULE q?VPE » p ep e ldo 

(Yi) sfqtitwt rtFflrnTPfPTnTH nrp in ran? 521 

Arg Gly Asp Ria Leu Lys Asp His lie Lys 
5 10 

^2 /fl> , (3) raronMAT IO M FOR S SQ IP W Ol 53l 

( ly odqudngd oianAoroniOTioo i 

Z^M> (A) LHHGTHi 10 amin e as ide 
7)J2> (B) BWi a m i n e aeid QUI 
CTRAMDDmiDOOl 
■( p) TOrOLOCY i untoown 

( 11) MOWeWfl WPh pop fa id e 

^/fl gD> <xii ag e o Bwe c pooGniroioii i ceo i p ho* 53: 

Arg Ser Aap His Leu Thr Thr His lie Arg 
5 10 

<2\&/* (31 IMfOWWTJOM TO W SF Q Tf> NfV 54 x 
t (ij ODQOBHOB < 

SB« 10 -aiaino ac 




te^STRA» DDDMDQ&; 

yfry-TO p e iiOOYi unkno w n 

(LI) MOLECULE ■ popeldo 

(Mi) CDQUDHOD DOOOftlFTIOH: ACQ ID HO x 54 1 

Gin Leu Ala His Leu Ser Thr His Lys Arg 
5 10 

( 2 ) nas o a n Aw o w paw sbq » no t 55 1 

<i) SEQUENCE CHARACXE«g«C*t 
y^) /> -W=™ja«H> 10 amin e acid s 
<00 >( B > TYP E* arel ne « e *d- Pi?T 
- i<3 - ) • S TnAM PEPMgsc*» r r> * 

\M) TOPOCOOTi cntarown 

< ii) MOLECULE TYgflt p e p tide — 

^Jl&0> < » i) OEQUBKOB DESOniPTIOM l CBQ I P M O I 55 1 

Gin Ser Val His Leu Gin Ser His Ser Arg 
5 10 

^2/P> p} WPOWWMOM BOtUSJEQ Tr > 56: 



ni0Q?IO0 : 



<^ 2 ) | J>W~feBNG¥« : 10 umiiiu uu ±dt3K — 
7TTT^ (D) TYPB i amine aoAd- ?K\ 
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j/ll$0> (Hi> SBflWHea DDOORITTIOM i OPQ ID MQi 56i 

Gin Lye Gly Hie Leu Thr Glu Hie Arg Lye 
5 10 

(t) sequence characteristics! 
£ 2-1 t^ W tiEM gi m 10 amino acids 
^ XD ) T¥PD: amin e acid PtfT 

^T&) OTRAMDBDMBOQ I 

tD ) T O POLOG Y i mdmuwa 



LEOULB UTPE l pop t ide 

DDOORIPTIOM l OPQ ID HO t 57 1 

Gin Gly Gly Asn Leu Val Arg Hie Leu Arg 
5 10 

^llO^> < B) INFORMATION TOR ODQ ID HO I 58t 
4? I ^^4_£raAN©BBNB6e* 



4A@f£? ftri^-&EQUENCT-1»SCR«> TI O «l SE Q I P W O. 58 1 

Asn Gly Gly Asn Leu Gly Arg Hie Met Lys 
5 10 

■^2/C>!> INFORMATION FOn CEQ ID 110 1 59: 

<i) OEQUBNCP CHARACTER I 5 T ICS I 
^|1£ (A) LENGT H i 10 amin e a c i de- 
<^a >( D) TYPE i amino aeid PRT 
~" ~frC) S T RANDEDNE5S I ^ 
(D) TOPO LO GY i unfcnown - 

(ii) MOLECULE TYPP i pep t id e 

y p> (xi) SEQUENCE DESCRIPTION; SEQ ID NOT 59 1 

Ala Arg Ser Asn Leu Leu Arg His Thr Arg 
5 10 



[2) I NFORMATION 

< i ) S S QO BNfTg CH ft R A CTYRIST I C S I 
ZZM^* (*) T.rargrHi 10 amino adds 
?>f g;(D) TYPDi aroiuu acid PUT 
m (C ) 8TRA»DED tf E3S I 

(D) TOPOLOGY 



<H) Mnr.Kf.nT.T; typei pp.pt.lde 

^fo*0*> ( Ki ) gEQU E BCE DEGORIPTION i GDQ ID NO I 60 1 

Leu Gin Ser Aan Leu Val Arg His Gin Arg 
5 10 



^J0^> (3> INFOR MAT I ON F O R CEQ ID MO 61: 

( 1 > SE Q UENCE CHA R ACT E RI S TIC S : 
/jf l> (**-*BNOTH-r 10 -amino e oi ds 
7^g>W- TTPET am i no am id fitfT 
(C) STP A WDTf Ot i lg S S I ^ 
<P } ■ TOPOLOGY t - unknown 



<ii) MOLECULE JYBEl.peptad< 
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( II A) CEQUHMQD DDOORIBEIOH l ODQ ID MO I 6lt 



lie Ala Ser Asn Leu Leu Arg His Gin Arg 
5 10 

(3) DIFOnMflBIOM FOB GEO. ID WO » 62: 

(ii gequem c c cwAwucaaaamc&i 

^j)l> ( A) WWHh 10 nminp a e i ds 
^SJa X D) TYTDi amin e a e ad - pf(J 

(D) topology i unknown 

( ii ) M O L BCUL B TYPf- pflptlria 

Asp Arg Ser Asn Leu Glu Arg His Thr Arg 
5 10 

(i > OBQUDNCE OIIMlAOBBni 6 ai i C frl 
^| j> (A) LEMGB H-t 10 amino a eide 
TT»5>< D ) TYPE i amino aei d PkT 
"> *C (C) 3THAUDED M E30 i J M 1 

(P) TOPOT OGY t nnVnnwp., 

( ii ) MOL ECU L E TYPE I p a pt ido 
/jW&2L "Q" 86 8 P B g C niPttlOU l C E Q I P M O i 63: 

Aan Gin Ser Asn Leu Glu Arg His His Arg 
5 10 

(S) INFORMATION TOtl E Eft I D M O l 64l 

- ( i ) S SQ U B MGE CHAnACTEniCTIOO i 
<2_fl> ( A - ) - LDNOTH » 10 ami no a c ids 
Tm5 ?< b > TTOS e a m i n o so id ftffT 
M . t^ ,„ ( C ) BBRA KPBBHBapt — 

*if> ) topology i nnknown 

(ii) M 6 LD00LD TYPD t pe ptid e 

^ 400^ (Xi) SE tf t BWCE-&BS GniPIIOHi CDQ ID WO I 64 1 

Gin Gin Ser Asn Leu Val Arg His Gin Arg 
5 10 

^2(0^ (2) iMronmwiOH for ocq id wot 65i 

(i)6EQUBN0B OUARAOTCRI0TIG0 I 
||^ (A) LEMCTHc 10 amino acids 

( D ) TO FOL QGEi . unknown .. 
(11) HOLBOPL B- TOE i pep t i d e - 

Asn Gly Gly Asn Leu Gly Arg His Met Lys 
5 10 



4 2\Q> { 2 ) " ironMA * IOM P0R 3 g 0 IP MO i 661 

( C ) 3 T RA HP E DME 33 I ' 
■ (D) TOPOLOGY i unknown- 

4U4 MQLBCOL& TYPF* papfcids 
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OCQUDHGD DPOCRIPTIOHi OCQ ID HO I 66 1 



Asn Gly Ala Asn Leu Olu Arg His Arg Arg 
5 10 



( i > GDQUDHQD CIIARACTCRIOTICO I 
^j/j g tA) MHCTHi 10 amino aa.lrin 
^3i2> t P) T CTEl laidiiu mid pflT _ 

(D) TOrOLOCY i unknown , 

( ii ) H OLBOULE TYPD i peptide 

(yi)OEQUEMGE DEOCRIPTIOHi OEQ ID HO I 67l 



Ser Gin Gly Asn Leu Gin Arg His Gly Arg 
5 10 



j£X\Q > < c > DironitATiOM ron cdq id mo i 68: 

M s OEQUCI i eU CHARACTER I OTieOt 
/Xl\ P tt k \ UMCW l 10 ■Trn'nn ■ nfrif 
^jj^ X D) TYTDi amin o a c i d pflf 

- (C) 6TOAHD E PN S66+- "* 

(D) TOPOLOGY i unkn o wn 

<il) MOLECULE BffBi p e p tid e- 

-<!^PD> < * i) ODQUENGJ DDOCRIPTIOH; CDQ ID HO 

Ser His Pro Asn Leu Asn Arg His Leu Lys 
5 10 



^^\0^> -{ 2) INFORMATION FOn C E Q ID WO I 69 1 
^ij^S EQ UEHCE CHARACTER I OT IOO | 



Ml 

( P ) TOPOLOGY i unknown 
'(lii) MOLECULE TYPE ! popttdo 
4jt&0> ( x i) OCQODHCE PEaCHl P TIO H l 3EQ ID HO l 69: 

Thr Pro Gly Aen Leu Thr Arg His Gly Arg 
5 ' 10 

^2lfi> (S) IMPORHATIOH TOR CEQ ID HO t 70: 

(A) SS QWEHGE CHARACT E RI S TI CS i 
OJf> (A) WWW i 10 amino acids ■ 
qTI > < p ) TYPQ ' amin o acid- PR[ 
(O) OTDAHDED M DGO : 

( ii) HOLDOULD IYPDi p e p tide 

^{(flQ> f* i) OEQUDHCE DCOCRIPTIOHt 3E Q I D HO I 70l 

Asp Arg Ser Asn Leu Glu Arg His Thr Arg 
5 10 

Z^\Q^ ( 3 ) IWr ORM ATIOH foo CEQ ID MOt 71i 

<^/>J>t ft->-faEMOTHT- 1 0 sn&no arrrds 

75)^5^ >" T YPEl miduu «g*d P/^ T 
■ ■ ! 3TRA HPB DHE33 r ' 

-fO) — TOPOLOGY » unkn ow n 



-(-i-ir^ -MO LEG ULE TVPDb pap tidy . 
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<4fl0j^ ( M i) CDQPOHOD DDCOniPTIOMl ODQ - ID MO 

Asp Kis Ala Aan Leu Ala Arg His Thr Arg 
5 10 

^2|^> < a> niFO R MACTIOM FOR - OEQIP - MO: 72: 

< 1| 0EQUENC3 CHARACTERIflTIC&i 
^2tL ^(A) LBKOTttl 10 



^Sf5w TYPffr nmino ao&d— 

1**/ 'oiniinvsutuiww « 

(D) TO PO LOGY I unknown 

fil) MOfr B OOLB EYPD i p e pb A d o - 

jt$0Q!> OCQUDMGD DEOOnirTIOH l CBQ ID M Qi 72l 

Aep Arg Ser Ser Leu Thr Arg Mia Thr Arg 
5 10 

(2) XWgQR M AT IO M F O B ODQ ID MO : 73i 

(i> CEQUBNOB CIIARAOTDRIOTIOC : 

^^| ^ (D) TYPHI amino a c id p^>J 

( P ) TOP O LOG Y* unknown— 

- MOLDOUIiD TYPDi p e ptide 

4jyPQ^ (itl) CEQUENGE DB S CniDT a OM l BEG , IP I TO t 73 1 

Glu Arg Gly Thr Leu Ala Arg His Glu Lys 
5 10 

( L i GEQUEN G E ■ GHARAGTEniEII Cfi t 
Qjl> (ft) LENGTH* 10 iminn aelda — 
^1'^- (D) TYPEt aiuiiio acid I»T 

( D ) TO P OLOC V o. u nknown 

(H) MQLBCULE TYBF; peptide 

/JfySH)& (Ai ) BEQUBHGB D B flGRIPTIOHs ODQ ID HO l 74 1 



Asp Arg Arg Leu Leu Asp Arg His Gin Arg 
5 10 



(g > mrORMATIOH TOR OBQ IP MO. 75i 

(i> 0BQUEHCE CHARA CT ERICT1CO ii 
Of ! >('* )■ LENG T H i 10 arain p a aid a 

■ . , ^Tpi ) t ypb i amino acid J>f2~f 

C^f^X^n^ rnmiinnnniiT.nn . f ^ 

(D ) TO P OL O G Y I BrtOTOWB 

(ii) MOLECULE TYPE i peptid e 

4j fc0Qj> ( « *■ ■) - OEQUBMOB DPOGRIPTIOM l ODQ ID NOf 75 : 

Asp Arg Ser Ser Leu Thr Arg Hia Thr Arg 
5 10 

^i\0^ ( 3 ) PWOB M A I WOM FOn OBQ ID 110 1 76: 



(i ) SEgUg WC X CH ARACT E R Y ST IC O l 
^^\\^ i^h JiSNGTOi 10. amino .JMvida 

k|^^(D) TYPE b amin o-ttctd Pj&T" 
^H£^. tC7 OTRAHD E DMEJai -X*^ 
(D) TYTPfiTtOGY i nnlmnwn 



{LI) M OIiBOULB TYPH I pepeAaV 



6,007,988 

65 66 

-continued 



( l i t ) 6 EQUBHOB PPOOniPJIOHl ODQ 



Glu Arg Thr Ser Leu Ser Arg His lie Arg 
5 10 



(1) OEQUEHOE CWWACOTRXCaJCfci 

3. 

(O) GITIAUDBDWEGG I 



y ji ff * fV MWWH 10 wnAno aaidi 
(0) CTOAMDBDWBCC l. 

(D) wetewi Briwown 

( A A.) M OLECULE PWPE i p e p iid o 

;tJ\W> (Ml) OCQUDWOD DDOOniraiOll l ODQ ID l?Ot 77l 

Ser Ala Gly Thr Leu Val Arg His Ser Lys 
5 10 

^2f^> ( 9 ) raroruuvgioH ron odq id wet 78 1 

(l l 0DQUI1MOD - OnARAOTDRIOTIOOi 
<2jf^ (A ) MB M WM h 10 aataw ereAjto^ 

^* (C) GTRAHDDPHEOO t — 

' iOf ■ TOPOLOGY i ualtno wB 

( ii ) MOLE CU LE TY PEi p e pt i d e 

/jt£££i ( W l) OJQUDNOD DDOOniroiOM l ODQ ID HO I 78l 

Gin Ala Gin Thr Leu Gin Arg His Leu Lys 
5 10 

^j}0y ( fi ) D i rORMATIOH TOR ODQ ID HO i 79: 



( D ) TOPOLOGY « unknown 

(ii ) MOftBCTOE TTPgl poptldU 

ZJ ^PO^? (*M CEQUBNOB DECOniPTIOH l CEQ IP HQ I 79 l 

Glu Lye Ala Thr Leu Ala Arg His Net Lys 
5 10 

^3.fP^ < a> mroniiATioii tor odq id hoi 80: 

^Mj^ Jh) IiEMQTm 10 amine eei d a 
7T|>>W TYTB i amin e a c i d P/cT 

<ftK i. (G) cmrji p gptisgs i ' ~' ' 
(n) TflgOTjOGYi unknown 




typo i pepttfte 

•£./Jp#J> ( H i) 6EQUEWGE DBG GO I PT I ON I CEQ ID MO * 

Thr Gly Gly Ser Leu Ala Arg His Glu Arg 
5 10 

IH FORMAT IOM TOR ODQ ID »Q » 81: 

( 1 ) OEQUDHOJ OIIAnAOTCniCfJ I Cff I 
^| /> (ft) TffWf7THt 10 amino ■nldo - 
^gfrr r ( B ) TYP E i amino aoid PRj _ 
^»tT . |< (Q) OTRANDBDHDOO I n 



(il) M OLECULE T»Bl p e pLldi 
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^ J$b*> (niiS CEQUDNCT PBOORIPTIOH t - SIX) 
Arg Gin Ser Thr 



Arg Oln Ser Thr Leu Gly Arg Hie Thr Arg 
5 10 




-f<3) BTlUW DEB MEOGt 
-(• D) TOP O LOGV * unknown 



-tjAy-Meii COUiiP ^ P YP Di pept ide 
£$Q0*> ( xi ) SBQUEHCS DB SCBI PJ IO M * 6 BQ ID HQ I 82t 



Glu Lye Ala Thr Leu Ala Arg Hie Met Lye 
5 10 



( * ) S BQUBMGB - G i UVnftOTEniCTiOC l 

J2\\y (ft) LEMGTO i 10 -amine acida 

7^» ^(P) B YPE t ami no -a c id D/JT 

( C ) STRAMPBPMBE 6 » f P » 



STRAMPBPMBE fi I 



(D) tqpqlqcyi uiJmowii 

(11 > MOLECULE TYPE i peptide 
jlf4XX> (^1 ) SBQUMCB. DBSCaiPBIOMJ SBQ IP MOi 93 » 

Gin Ala Gin Thr Leu Gin Arg Hie Leu Lye 
5 10 

4^\0^ ( S ) m FORMA T I OII FOR ACQ ID MO l 84: 

i & ) B BQUBH B&^HARfte TDRIOTieO. 

tENGBTtt» 10 -a min e - a e ida — 

T>T5S(g»-OTFB i aminu aula PPT 

\D) TOPOLOGY i unkn ow n 

(11) MOLECULE TYPE i popfcide 

6E QUBH0B DBOORIPTIOM i CDQ ID HO i 84: 

Glu Arg Gly Thr Leu Ala Arg Hie Glu Lye 
5 10 

(3) IHrOHMATION FOR SE P , B P WO « 85i 

(i), OEQUDHGD OIUuTftCTBniOTaOC t 

Of/> *A) . L EM C T Hi 10 nminn nnirin 

v,?]^ (D ) mfP &t-wn teo - ttild ■ ACT 
f f e ') -aT RA HDDONDGG 3 -- J - " " 

fo r Tt iPO LOO T i ontotrown 

M O L ECUL E TY PE 

'< « !) ■ GEQUENCD DDGQRIPTK 

Gly Arg Asp Ala Leu Ala Arg Hie Oln Lye 
5 10 

^>fCL> (») TO FO RMA T I O N F OR 0DQ ID MO J 86t 

jkL- 




( i ) ■ ODQU B HOJ OHAIUWOTDn EOTI O O* 
g2jj> (A ) LEM OTH i 10 -am in o a e ida 
^2* y^* (B) OTPS i ■ mino acid 

^ 1 (C) 6TnANDBDMEC0 l 



(D) TOPOLOGY i unkno w n 



(ii ) MOLECULE TVPDi peptide 
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Arg Gly Pro Asp Leu Ala Arg His Gly Arg 
5 10 



87r 

SEQUENC E G HAflAOTEIttOB i EGg *- 
^^if> (A) LDNGTHt 10 amino soldo 
^^^>4B ) ^^^^^Lno- acid - 

(il ) MOLflOULI] TVTE i popfaido 

W|^K> ( W &) S EQUENCE DDOOniPmOHl ODQ, ID WO a 87: 

Ser Arg Asp Val Leu Arg Arg His Asn Arg 
5 10 

(fi) INFORMATION FOR 0CQ ID MO; 88: 

10 -a mino - s o l do - 
v>|2 V( D ) W* 8 * amino e o ed PJPT 
^ ^Q) omuaroPDNBcc *. US 

(D) TOPOLOGY i unkn o wn 

(11) MOLECULE TVrE i pop » ido 

^M&C? ( * *-) OCQUPNGD DDGGRirHOli l CEQ ID MQ e 88: 

Arg Arg Asp Val Leu Met Asn His He Arg 
5 10 



^i)0> (B) IMPORMAHPIOH FOn CEQ ID « 0» 89: 

j J t mmirawm «nmioiim«iTffniT«n ■ 

\ / "UI/VUUHUU" 'uiuuutuvuivirwtvvr 

(* ) L E WOT H: 10 amin o aciao 
^^^ (D) fPYPB : amin o aeid PfiT 



(P) TOPO L O G Y: un k nown 

(li) MOLECUL E TYPE: p o pt i do 

2 jj&0^?_ (Mi) SEQUENCE BBC OR I PT I OH l SEQ ID NO: 89: 

Arg Lya Asp Val Leu val ser Hie Val Arg 
5 10 

JL2\Q> t*1 IHPORMAmOM FOR CEQ ID II Q4 90: 

y^l^ 1 {A} '-6EIW*H-i 10' amino aoido — 
T^yg^CR) TYPE: nmlnn nnlri f>£T 

(G) GTnANDBDWBCO : 

(D) TOPOLOOVi unknown 

( »! )> MOMCtM TY P E: p o pt ido 

4jl&&> ( Xi) OflQUDNOD DCCOniPg aO M l 6 Eg I P MO: 90 1 

. Arg Arg Asp Val Leu Met Asn Hie He Arg 
5 10 

(8) INFORMATION TOR ODQ ID NO: 91: 

t O )- ^POPOLOQ¥ t- unkn ow n ■ 
TVPD: pe ptid e 
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( x i) GEQUDHGD DEOCItlPTION l CEQ IP HO I 91s 

Arg Glu Asp Ala Leu Thr Ser Hia Glu Arg 
5 10 

(2 ) INFORMATION FOR CEQ ID WO I 92: 

(i) SEQUENCE CIIARflffPBntSBIOC t 

(A) Lesoth o 10 -ag faw -a eAde — 
JB) TYPE i am i no acid pftT" 



—( D ) TOP O LOGY i un k nown 

( ii ) MOLECULE HPBt peptid e 

/jlW> S E QUENCE PBSCR I PTI OM : SSQ ID MO ; 92 1 

Arg Val Asp Ala Leu Glu Ala Hia Arg Arg 
5 10 

£Zl0^ -(-3-)— INFORMATION FOR fi B Q I P M Oi 93: 

■t ±1 SE Q UE NC E ClIARACTimiOTIOO l- 
^>/|>t A 7 LENGTHS 10 am ino ao ld a 
^p- ^B) TYPE i amin o aoid _^/^T 

(D) TOPOLOGY i unknown 

(ii) MOLECULE ffYPE l p o ptido 

( xi ) S E QUENCE DESCR IP TION : SEQ IP W Ol 93: 

Asp Arg Ser Ser Leu Thr Arg Hia Thr Arg 
5 10 

^^.[0> ( 3 ) INFORMATION FOR OSQ ID HO : 94: 



10 -amino arlrin 




Si 

( ■ D i ro P OLOQ¥ t— ttriBW W i 

(ii ) MOLECULE TYPE: pep tid e — 

^^0J> (* * ■) S EQUENCE DBGGRIPTIOH l CEQ ID MQ.t- 94 : 

Glu Arg Thr Ser Leu Ser Arg Hia lie Arg 
5 10 

£2.\()y (S) INFORMATION POn CEQ ID HQ * 95: 

(i) 0EQUEHCP CHA R ACTER 1ST ICS. 
^/f> (») LENGTH: 10 ■ mino amide 
T^i^ yfrD) TYPDi amin o aeid PflT 
^ (C ) S T RAM DEPH ESfl 8 

(D) TOPOLOGY i unknowa 

(ii ) MOLECULE TYPE I p e ptid e 

DDOORirTION l CEQ ID MO l 95 l 

Gly Ala Arg Ser Leu Thr Arg Hia Gin Arg 
5 10 



96: 




< i ) OEQUDNOD CHARACTER 1ST ICS I 
j > LENGTH: 10-nmlno anlrin 

(G) STRANPEDNKSSl . < I i » 
(P) TOPO t OCYi un k nown 



(ii) HttWWfl WWB< p o pt id i 
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CBQ IP HO I 9 6 t 



Thr Gly Gly Ser Leu Ala Arg Hie Glu Arg 
5 10 



ODQ ID 110 1 97i 



< i A- S E Q UEN CE CWtfWOMRtWtQfl i 
^{J> (A) LDMOTHi 10 amino tt e W » r 



( C ) 6B HAMDBDMEGC I 
( D ) TOPOLOOYi unkno w n 

j ii ) MO L ECU L E TYPE : p e pt i d e 

/4CQ^ ( » 1) &B QUDWOD DDOORIPTIOUt BDQ ID MO i 97l 



Gin Arg Ala Ser Leu Ala Ser Hie Met Arg 
5 10 



9H" P 0 tt " O DQ ID HO i 98 1 

(1) S EQUENCE Q HARft fi BBB SS TICfi i 
j X|_L> fftr-fcgtW TH* 10 - amino acida 



(D) TOrOLOGV i unfcn e wn 
(H) MOLECULE TYP E I popfrido - 
^_4flR> (M^) GDQUBHOD DflOtMIPTIOHi 8 



Asn Arg Aep Thr Leu Thr Arg Hie Ser Lye 
5 10 

Z^\0^ (3 ) INFORMA frie H TOR ODQ ID HO. 99: 




-(*>)— TOPOLeov i Brimwa 

< ii ) H OlJCtfL B-CWBi-pep+Ade.. 

/AfiO^ ^x&J-gBQBBMGB-B DGORIPTIOH: OBQ ID NO l 99 1 

Glu Arg Gly Thr Leu Ala Arg Hie Glu Arg 
5 10 

^/g> -W~n»FQRHM»e » - FOR OCQ ID W6l 100 1 

^^/Q - W LK M C THi 10 amino n rfjje^ 

^P^%G) !> TRA1»DSPttE6S-»- 

-ffrfr-TO P Q LOO Y t . i u n to o wn 

( 1 1) MO LECU L E W8 1 pept ide 

(x i > SEQUEMCE DWCMWIOm SE Q « HP i lOOl 

Arg Gly Asp Ala Leu Thr Ser Hie Glu Arg 
5 10 



3 ) tWF O RMAT I OH SOW OCQ ID HO i 101 I 

f i ) QEQUBWCB CHARACTER 1 0T ICO I 

^^P ^ ( B ) TYP E i amino a ol d- pfijf 
■ (■S t ) TOPOLOGY i nnVnnwn . 
t ii ) MOLCGULD TYPDi pup Li Jo 



6,007,988 

75 76 

-continued 

( * i) OBQUBMGB DESCRIPTION: fl E Q ID W O I 101 t 

Arg Ala Aep Ala Leu Met Val His Lys Arg 
5 10 

( fi ) inreRMATiow ro w sea id hoi 102 : 
(L\ gequdhob eatfaewMswcft 

sSj(S W bBMSWH 10 a min e ae ide 
^Z\X> (°) WPP » amin e e e i d- PfiJ~ 
— jC} -SSRWBBWIBW- -* 

( 11 ) MO L ECULE BYP B l pop* Ida 
/jl&O^* CBQUBWOB DECOniPHONa ODQ IP BOi 102 1 



Arg Gin Aep Thr Leu Val Oly Hie Glu Arg 

5 10 



<JjS> 



( 3 ) KIFORMMIOW FOR - CCQ ID - KO i 103: 

(i) S8QU E M C S C HARACT ER I S T I C S i 
42jj > (ft) MMCT H I 10 amino aoido 
TYPE i amin e a o i d POT 

• %D ) T O POLO G Y i mduiuwn 

(11) MOLBOULE TYPHI pep tide 

<^0t>> " ( x i) ODQUBNGD - PBOCItlPTJOM. 3 E Q IP W O I 103: 

Arg Qln Ser Thr Leu Val Arg His Thr Arg 
5 10 



ODQ ID HO I 104: 



<2f |J> -fA-) — LENGTH- : 10 -a min e a o i do 
^2|^^>< D > T¥PB: amiuu acid - " A 



(P) TOP O L O GY i unknown 
(ii) M Q LB C PUS TYPE: pop t ide 
/Jhff&& (xi) 6 EQU E KGE - D S ecniP^IOm CBQ IP HO l 104: 

Arg Ala Ala Aep Leu Aen Arg Hia Val Arg 

5 10 

< tJ2 (2) D i rORMA M ON IH)R ODQ IP HO i 105: 

(1) CEQOEWGE ■ GHAHAOTEni S JIC S I 
y^|(j> - (A) LEHOTH i 10 -amine aoido 
TTT^V ( B ) TYPE i ainluu auid - p#T 



(0) OTflAMPDPHDGO I 



/Jl&$> <*M fiyQHKWCK J)KBCBTPTIONy BBQ IP MOl 105: 

Arg Lya Aep Val Leu Val Ser His Val Arg 
5 10 

/^fi^ (a> njronMAGPi o M ron ceq ip h oi 1061 

( 0.) GEQUDNOE CIIAnAOTPniOPIOO i 
V^||S(*r--tjENOWfr 10 aroi»u auldu - 
j bl> S ( D > TY PET- amluu hi Id D&T 

(P ) T OP O UWYi unknu w n 



<H> Mnr.Rrnnr typf- peptide 



6,007,988 

77 78 

-continued 

<xl ) OEQUBHCE DBBflftlPTIONi OEQ ID NOl 106 l 



Arg Arg Asp Val Leu Net Aon His lie Arg 
5 10 



(i) ODQUDHCE GIIARACTER10TIC9 1 
w . ( A ) L B HOTHi 10 amin e aci de- 
yBj TYPHI amin e acid - J^fff 

(D) TOPOLOGY I unkwewn ■ 

(li ) MOIiCCULE IVPDi peptid e 

< *1) OEQUENCE DE3GRIPTIQH l ODQ ID HOt 107: 

Arg Ser Aop Thr Leu Lye Lye Hie Gly Lye 
5 10 

( 3 ) mr OBM AJ IO M r ow 6 E Q ID WO i 108* 



(i) OEQUBHCE CHARACTERISTICS; 
\\y (ft) LEWOSm 10 amino saidc 
Tv >(D) ' T¥rD i amin e a old - P^T 

( D) TOrOLOCY i unknown - 



2§> 



< ii ) MOLECULE TVPDi p ep tid e- 

< Al ) aEQUBWCB DEflCtUPTIOHr ODQ ID NO 

Arg Gly Pro Asp Leu Ala Arg His Gly Arg 
5 10 

(2) mrORHATIOH FOR ODQ ID HO i 109 1 

( i ) OEQU EHeE-eHAR ACTCRICTlCa. 
, ^}| > (A) LENGTH I 10 wiwtno wnidn * 
v"?T5> < D ) TYTE t amin e acid . PvT 

.( G) GqmAHDBDHEOC i ' - 



4jl&0^ -t**i-)— SE QUENCE DESCRIPTION i O E Q ID HO. 109: 

Ala Arg Glu Val Leu Gin Arg His Thr Arg 
5 " 10 



ORHATIOH POP . CDQ ID 1 1 0 > 110 > 

< 1 > SEQUENCE CHARACTIRIOTICO I 
42 LENGTH i lO^a mln o a c ids 

J>f2><P) TYPE i a min e acid _PfiT_ 
^ , -.- {0) GT RA WDK DMX S&l -#/M 
(D) TOPOLOGY I unknown — 

(id.) MOLBOULB TYPHI p e p tide 

/Ji&ti? < wi ) CEQ ttB NCD-DBOCRIPTION* OEQ ID NOl 110 1 

Arg Glu Asp Val Leu lie Arg His Gly Lys 
5 10 

^^ffy <a> iNFontuCTioM ron odq id no t nii 

)> t A ) "^O ffH ^^aSuu I auliiu 
yZ (U) T*PE i amimi aild D /> J" 
^•^fit^L^ ^Q ) CTRAHDBDNE66 1 1 ■ ' 
(H) TnPOT^GYi unknown 



(ii) MOLBQUTK TYPE* p e ptid e 
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-continued 



H*fcr) CBQUDMGD DDOGRIPTIOMl SB Q I P W O! lilt 

Arg Ser Asp Leu Leu Gin Arg Hi a His Lys 
5 10 



>^]0> (fl> IMrORMATIOM POR ODQ iD 1 1 0. 112; 



C H A R ACT E R I STI C S 1 
A\l>. HH MW6WH 10-enfaie e a ido 
CYTE i amino a o id P£T 
— — STB S I ■* n 
( 4? ) TOBO L O C Y i unkn o wn 

■ (ii) MOL B O U LE PYP Bi popt l do 

■ CSff U B MOE - DB fi CRIPttlO M I CEQ ID HO i 112 1 

Arg Leu Asp Gly Leu Arg Thr His Leu Lys 
5 10 

!^/0> ( S >) IKFOnKATIOM FOR ODQ ID HOi 113 1 

<i) 6S QUBKGB ■ OHARAOTDR IOT ICO I 
yyi\> W LBIW SH* 10 ami ne- a c i de- 
j^2> (a ) PCTS i a min e- - a c id P£7~ 

^ ^ mrnviT rw Pf* i,_ „ ^ 

<H.) MrnJCPM TYBFt peptide 

4jfop> fiia ) GCQUDtlOD BiMWPW«K ODQ IP «Oi 113: 

Arg Gly Asp Ala Leu Thr Ser His Glu Arg 
5 10 



<lij > (ft) L EMffTHi 10 ronino aQidB 
^gJg^(B) TYPFt amino aold J>fjJ 

(D) TOPOLOGY ■ unknown 

■ < i A) -MQ LBCSL E TYP Bi peptide ■■ 

4j\VQE? (Mi ) ODQU BNeB-PBa ORIPTIOH t ODQ ID HO I 114 1 

Arg Ala Asp Ala Leu Net Val His Lys Arg 
5 10 

( i ) ■ ODO BBtte B - CHARA CT ER! A T T Cfl> 
^ LEN G TH: 10 sn&n o ecidp 

qnfPC i amino asad f>fij 

■ (D) TO PO LOOV i unkn ow n 

( il) MObBCULE TYPBi peptide 

^jjfB^ tyjr"Ogy»w eP" D D o e Ri- PTioiii ccq id iioi 115 1 

Arg Val Asp Ala Leu Glu Ala His Arg Arg 
5 10 

^2]*Qs> (a > mroniwpiow ron esq id iio i 116 : 




C O P O L O CYt unXno 



6,007,988 
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-continued 

/J^QQ£? < xi) ODQUDWCD DDOOHIPTIOH b ODQ ID MO l 116l 



Arg Arg Asp Val Leu Lou Asn His lie Arg 
5 10 



y^Jfl> (S) IMPODHM f lON POP CDQ ID 



117t 



( i^ SE QUENC E C WAWLCWHWfiWCfti 
(A) MMflWH 10 a mino a oido 
► (D) TYPDi amfare uu td- pfjjj _ 
" "(0 ) CTOtAMDDPMDOO 1 
( D) TOPOLOOYi unknown 

( ii ) MO LECULE TYTE » pep tide 

j ^Qb^ l " L ) CDQU BH OI1 DDOCRIPTIOH i ODQ ID MO I 117t 

Arg Glu Asp Val Leu lie Arg His Gly Lys 
5 10 



t&) tw powwra o i a fo r ae g t p m o x U8i 

( & A ■ SE QUENCE OIIARAOTCRIOTIGO r 
<^ lf> (A) L SMGWt 10 amine ocida 

TrTs ^ B ) w n faw —44 r>/JT 

<^-\^r CTOAMDEDNDCO l ' 
■tD ) - TOPO LOG Y ! ttnJwBWt 

( ii ) MOLECULE TYPE ! pop e ido 

t-j lGO^? (yi) SBQMSMCB PKCMWMMl 6 E Q I P H P* 118 1 

Arg Ser Asp Leu Leu Gin Arg His His Lys 
5 10 



^2|(7J> < c ) ihpormatioh ron odq id hoi i 19 1 

(i) CDQUENOD QHA fi ftCTERIOTICa i 
i*) lpmqtii i 10 -mafcno auAds 

tPI -TOPO L O G Y I UllkllUWi l 

( ll )- HOMCUIiD TYPD i p ap tide 

^Cl[0Py (Mi) GE SttSKe B DESCRIPTION: 0CQ ID HPi 119 1 

Lys Ala ser Asn Leu val Ser His He Arg 
S 10 

j^jfQ> -( ^ - iWFomwpi ot i ron cbq id w >: 120 : 

ML 




(ii) MQLBOOLD TYPO i p e p Lido 

^fiOS^ ( Ki ) S E Q U E NC E DB SC RI PTI O M l S EQ IP M O l 120 1 

Leu Arg His Asn Leu Glu Thr His Met Arg 
5 10 

^j^Q^fr Cg ? K I P O RMATIOH POP, CDQ I P H O 4- 121 1 

(*) SEQUENCE OHADAO B EngO B IOO i- 
>^>li>TA) m i OPlli 10 ba se pain i 
v>l> ^jD) TYPH i nuel e io aoid 
" g ™(0) C i gnAHD B DHECC l s i ng le ■ 
(O) TOPOLOCY i 1 in o ar 

<4©0J> t^-i-^gQ^^^ DE60HimOM i- ^ EQ ID NO: 121 1 



6,007,988 

83 84 

-continued 



CGCAGAAGCC 10 



^2-[ Q> (2 ) lit FORMAT I0W FOR ODQ ID WO I 122 1 

<i) OCQUCWGC 1 CHARACTERISTICS i 
,/2/l> (ft) LPWCnil : 10 baao p al r o 
y »> X B ) WW nuoloAo acid Jpft/t 
S TRAM PB PME fiSi oinglo 

<4#Q> ( M i.) SE QUENCE DBCCHIPOHOM i CEQ ID H O I 122 1 

TCCAGAAGCC 10 

' {^iO^ ( 3 ) R E F O R M AT I Q M FOB fiEQ I P M 0» 123 1 
(!) 6 E QUBN CS GmyRiWCTTBICTICS l 

(ft) TifflOTTHi 10 hnwn p/ilra. 
~|fe-( D) g¥PD o nu c leic acid DAM 

■< D ) . TOPOLOGY i U near 

l $Qti> < vi > "Cwwc* pg&CBJpaioth erg aa ecu 1231 

CGCAGGTGAG 10 



'124s 

(i ) ■ OBQOBHO B- q HftWWPBMCT ICS I 
y^j^ (") rffNfTTHff 11 araim€>-acida 
>f|7> W TYPE r a m i n e —id PRl 



(P) TOPO L O G Y I untai cvn 

f * A ) MOL ECULE TTPRi p e pt i d e 

B - DCOeRIFPIOMi OEQ IP H Oir 124} 

Glu Gin Ly b Leu He Ser Glu Glu Asp Leu Asn 
5 10 



^^\0^ W ™^ BMItTT " M ™° "3 n M " - 125i 

( A ) s dqudiiod eHMwcMRtwrieai 

y^/L> (ft.) tiCliOTH : 7 emcfcnq acide- 
J?l*y> ( D) TYPE i anduu qui d— P/<T_ 

<r,rr (c ) a Tr uw D g D i i Eaai 1 
(D) TOP OL OGY i unVnown 

fA A ) MOLECULE TYPE I p e pt i d e 

/MOQ& CE g tfBWCB P B fiGniPaiOlI i C EQ i P H Q I 125t 

Pro Lye Lye Lye Arg Lys Val 



We claim: 

1. A library of DNA sequences, each sequence encoding 
a zinc finger polypeptide for display, the zinc finger polypep- 
tide comprising at least one zinc finger having partially 
randomised allocation of amino acids, the partially ran- 55 
domised zinc finger having a random allocation of amino 
acids at positions -1, +2, +3 and 4-6 and at least one of 
positions +1, 4-5 or +8, position 4-1 being the first amino acid 

in the a-helix of the zinc finger. 

2. A library according to claim 1, wherein the partially 60 
randomised zinc finger has random allocation of amino acids 

at each of positions 4-1, 4-5 and 4-8. 

3. A library according to claim 1, wherein the encoded 
partially randomised zinc finger comprises the zinc finger of 
the Zif 268 polypeptide. 

4. A library according to claim 1 as a fusion with a DNA 65 
sequence encoding the minor coat protein of bacteriophage 
fd. 



5. A method of designing a zinc finger polypeptide for 
binding to a particular target DNA sequence, comprising the 
steps of: 

comparing the binding to one or more DNA triplets of 
each of a plurality of zinc finger polypeptides having a 
partially randomized zinc finger, the zinc finger 
polypeptides being encoded by a library in accordance 
with claim 1; and 

selecting those nucleic acid sequences encoding ran- 
domised zinc fingers which bind to the target DNA 
sequence. 

6. A method of designing a zinc finger polypeptide for 
binding to a particular target DNA sequence, comprising the 
steps of: 

screening against at least a portion of the target DNA 
sequence, a plurality of zinc finger polypeptides having 
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